Attribution and its annotation in the Penn Discourse TreeBank
نویسندگان
چکیده
In this paper, we describe an annotation scheme for the attribution of abstract objects (propositions, facts, and eventualities) associated with discourse relations and their arguments annotated in the Penn Discourse TreeBank. The scheme aims to capture both the source and degrees of factuality of the abstract objects through the annotation of text spans signalling the attribution, and of features recording the source, type, scopal polarity, and determinacy of attribution. RÉSUMÉ. Dans cet article, nous décrivons un schéma d’annotation pour l’encodage des objets abstraits (propositions, faits et possibilités) associés aux relations de discours et à leurs arguments tels qu’annotés dans le Penn Discourse TreeBank. Ce schéma a pour objet la capture de la source et du degré de factualité des objets abstraits. Les aspects clés de ce schéma comprennent l’annotation des intervalles textuels signalant l’attribution, ainsi que l’annotation des proprietés caractérisant la source, le type, la polarité de la portée, et le degré de détermination de l’attribution.
منابع مشابه
Attribution And The (Non-)Alignment Of Syntactic And Discourse Arguments Of Connectives
The annotations of the Penn Discourse Treebank (PDTB) include (1) discourse connectives and their arguments, and (2) attribution of each argument of each connective and of the relation it denotes. Because the PDTB covers the same text as the Penn TreeBank WSJ corpus, syntactic and discourse annotation can be compared. This has revealed significant differences between syntactic structure and dis...
متن کاملThe Penn Discourse Treebank 2.0 Annotation Manual
This report contains the guidelines for the annotation of discourse relations in the Penn Discourse Treebank (http://www.seas.upenn.edu/~pdtb), PDTB. Discourse relations in the PDTB are annotated in a bottom up fashion, and capture both lexically realized relations as well as implicit relations. Guidelines in this report are provided for all aspects of the annotation, including annotation expli...
متن کاملThe Penn Discourse TreeBank 2.0
We present the second version of the Penn Discourse Treebank, PDTB-2.0, describing its lexically-grounded annotations of discourse relations and their two abstract object arguments over the 1 million word Wall Street Journal corpus. We describe all aspects of the annotation, including (a) the argument structure of discourse relations, (b) the sense annotation of the relations, and (c) the attri...
متن کاملAnnotation And Data Mining Of The Penn Discourse TreeBank
The Penn Discourse TreeBank (PDTB) is a new resource built on top of the Penn Wall Street Journal corpus, in which discourse connectives are annotated along with their arguments. Its use of standoff annotation allows integration with a stand-off version of the Penn TreeBank (syntactic structure) and PropBank (verbs and their arguments), which adds value for both linguistic discovery and discour...
متن کاملAnnotating Attribution In The Penn Discourse TreeBank
An emerging task in text understanding and generation is to categorize information as fact or opinion and to further attribute it to the appropriate source. Corpus annotation schemes aim to encode such distinctions for NLP applications concerned with such tasks, such as information extraction, question answering, summarization, and generation. We describe an annotation scheme for marking the at...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- TAL
دوره 47 شماره
صفحات -
تاریخ انتشار 2006